HighProbability determines which alternative hypotheses are sufficiently probable: Genomic applications include detection of differential gene expression

نویسنده

  • David R. Bickel
چکیده

Many genomic experiments, notably microarray experiments seeking to detect differential gene expression, involve calculating a large number of p-values. This leads to the multiple testing problem: when the number of null hypotheses is large, the probability of accepting at least one false alternative hypothesis is often much greater than the significance level of the tests, which tends to mislead investigators. Software called HighProbability provides a simple, fast, reliable solution to the multiple testing problem, with applications to many areas of bioinformatics. For example, in a microarray study, HighProbability can determine which genes are probably differentially expressed. Given a set of p-values not adjusted for multiple testing, HighProbability determines which ones are low enough to imply a high probability of the truth of their alternative hypotheses. The set of p-values may be determined by conventional hypothesis testing or by random permutations using existing R or S-PLUS software. HighProbability is freely available under license through http://www.davidbickel.com. Coded in S, HighProbability currently requires an installation of R or S-PLUS, but the algorithm is short enough for fast implementation in non-S languages as well.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

Shrunken p-values for assessing differential expression with applications to genomic data analysis.

In many scientific problems involving high-throughput technology, inference must be made involving several hundreds or thousands of hypotheses. Recent attention has focused on how to address the multiple testing issue; much focus has been devoted toward the use of the false discovery rate. In this article, we consider an alternative estimation procedure titled shrunken p-values for assessing di...

متن کامل

Expression Analysis of RNA-Binding Motif Gene on Y Chromosome (RBMY) Protein Isoforms in Testis Tissue and a Testicular Germ Cell Cancer-Derived Cell Line (NT2)

a key factor in spermatogenesis and disorders associated with this protein have been recognized to be related to male infertility. Although it was suggested that this protein could have different functions during germ cell development, no studies have been conducted to uncover the mechanism of this potential function yet. Here, we analyzed the expression pattern of RBMY protein isoforms in test...

متن کامل

Molecular detection of proteolytic activity of human parechovirus 2A protein by gene expression

  Parechoviruses form one of the nine genera in the picornaviridae family, and include two human pathogens: Human parechovirus type1 and 2 (Hpev1 and Hpev2). The genome of picornaviruses encodes a single polyprotein, which undergoes a cleavage cascade performed by virus encoded proteases to give the final virus proteins. The primary cleavage occurs by 2A protein and this step is critical for vi...

متن کامل

Detection of the “Tim” gene of sheep Giardia using “Tim” Gene primers of Giardia with human origin

Giardiasis is an important human parasitic disease. Giardia is a genus composed of binuclear flagellate protozoa. Giardia duodenalis is a parasitic species for a wide range of vertebrates, including humans. Heterogeneity in G. duodenalis has been shown by serological, biochemical, and molecular analysis. In the present study, the possible genetic similarity between Giardia in sheep and humansan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004